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The search for allelic variants that cause 

monogenic disorders or predispose 

to common, complex polygenic phenotypes 
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concerning mutant alleles of genes and their related phe- 
notypes. In this review, I will briefly summarize some 
general principles regarding the search for genes (more 
specifically, mutant alleles of these genes) that either 
cause the various human genetic disorders or confer 
predisposition to common, complex phenotypes. 

Monogenic disorders 

There are a large number of phenotypes (each of which 
is rare in the population) due to abnormal mutant alle- 
les of single genes. These disorders are usually called 
monogenic since there is one gene of paramount impor- 
tance related to the development of the phenotype; con- 
sequently, these phenotypes show a mendelian mode 
of inheritance. For a particular disorder, the mapping of 
the responsible gene could easily be determined by 
studying the transmission of polymorphic markers 
within a family. Positional candidate gene-cloning 
strategies could then be employed to identify the 
responsible gene by virtue of mutations (nucleotide 
sequence variants) present in a patient's D NA and not 
in controls.The genetic methodology identified a route 
to understanding the molecular basis of disease, which 
otherwise seemed intractable. On May 25, 2001, the 
knowledge-based database 0 M I M (0 nline M endelian 
Inheritance in Man)^'^ contained 1168 mutant genes 
linked to human monogenic disorders. Several notable 
examples of neurological disorders are shown in Table I , 
which lists disease genes, their corresponding pheno- 
types, and the years of the linkage mapping and their 
positional cloning. The first mutant gene-disease link 
discovered by positional cloning strategies was that of 
chronic granulomatous disease in 1986. 



The search for the mutant genes for monogenic disorders 
has been a spectacular success This was accomplished 
because of the mapping and sequencing of the human 
genome, the determination of the sequence variability, the 
collection of well-characterized families with mendelian 
disorders^ the development of statistical methodsfor link- 
age analysis^ and laboratory methodsfor mutation search. 
The challenge of the genetic medicine is now to decipher 
the nucleotide sequence variants that predispose to com- 
mon complex, polygenic phenotypes The methodology for 
thischallenge isin development and constant evolution. It 
isanticipated that, in the next 10 to 20 years, susceptibility 
alleles for these common disorders will be identified. 



T 



he principal aim of the field of genetic medi- 
cine is to discover the links between nucleotide sequence 
variation in the human genome and the various human 
phenotypes. The methodologies of linkage analysis and 
mutation detection, along with progress in the mapping 
and sequencing of the human genome and that of model 
organisms, resulted in a plethora of exciting discoveries 
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Gene 


Disease 


Mapping Cloning 


HD 


Huntington chorea 


1983 


1993 


NFl 


Neurofibromatosis, type 1 


1987 


1991 


ATI 


Ataxia-telangiectasia 


1988 


1995 


SCN4A 


Hyperl<alemic periodic paralysis 


1990 


1991 


VHL 


Von Hippel-Lindau syndrome 


1988 


1993 


DRPLA 


Dentatorubral-pallidoluysian atrophy 1993 


1993 


SMNl 


Spinal muscular atrophy 


1990 


1995 


SODl 


Amyotrophic lateral sclerosis, familial 1991 


1993 


PSl 


Alzheimer'sdisease, early-onset 


1992 


1995 


PS2 


Alzheimer'sdisease, early-onset 


1995 


1995 


FA 


Friedreich ataxia 


1988 


1996 


PARK2 


Parkinson's disease, juvenile 


1997 


1998 



Table I. Partial list of selected mutant genes that cause monogenic 
neurological disorders. 



The work of numerous investigators in botli academia 
and tlie biotechnology industry over the last 20 years or 
so has provided the infrastructure necessary to perform 
studies of linkage mapping and gene identification of 
genetic disorders.A large number of polymorphic mark- 
ers due to variable units of short sequence repeats 
(SSRs) have been identified throughout the entire 
genome and used to create linkage maps of all human 
chromosomes These highly polymorphic markers in turn 
provided the tools to localize the unknown disease- 
related genes to intervals of the genome.The most com- 
mon of these SSRs are dinucleotide repeats such as 
(GT)pi or (AT)n. Dense linkage maps that determined 
the position, order, and distance of adjacent SSR s have 
been produced by genotyping the DNAs of appropri- 
ate large families including those collected and distrib- 
uted by CEPH (Centre d'Etude de Polymorphism 
Humain).^ A "draft" sequence of the entire human 
genome has been obtained and is publicly available." 
A pproximately 40% of this sequence is already finished 
(ie, of high quality, ordered, and practically gapless), 
including that of the chromosomes 22 and 21." The 
availability of the nucleotide sequence has two impor- 
tant consequences for identifying disease loci. First, the 
recognition of a total of approximately 35 000 genes will 
now greatly facilitate and accelerate gene-disease 
matchmaking. Second, the discovery of more than 2 mil- 
lion single nucleotide polymorphisms (SN Ps)^" will likely 
result in the recognition of the functional nucleotide 
sequence variability that is associated with common 
complex phenotypes 



The story of positional identification 
of disease-related alleles 

Let me now describe a typical project to identify the 
mutant gene associated with a monogenic disorder 
(Figure 1), for example, the autosomal dominant H unt- 
ington disease gene. Dominant means that a mutation 
in only one allele of an autosomal gene is needed in 
order to manifest the disorder. I n contrast, in a reces- 
sive disease, mutations in both alleles of an autosomal 
gene are needed for the phenotypic expression of the 
disease. M utations in genes on the X or Y sex chro- 
mosomes are associated with X- or Y-linked pheno- 
types. H owever, no matter the mode of inheritance, 
the general strategy to identify the causative gene 
mutation(s) is similar. 

Collection of families 

The initial phase of the project is to identify families with 
the precise phenotypic characteristics of the disease, and 
establish that the number of individuals available for 
study provides the appropriate power in linkage analysis 
to identify the disease gene location. The collection of 
samples from affected and unaffected members of the 
families is then justified, after approval of the study by 
the local human experimentation ethics committees and 
informed consent. For H untington disease, members of a 
large family from M aracai bo, Venezuela, were collected,' 
but the biomedical literature is full of other interesting 
family collections from different parts of the world and 
different geoethnic communities. The best population 
groups for rare autosomal recessive disorders are those 
in which consanguineous marriages are common, or 
those originated from a few founders. I wish to empha- 
size the importance of accurate diagnosis of affected and 
nonaffected individuals in order to perform errorless 
linkage analysis. A spects of the phenotype that have to 
be taken into account include the age of onset and clini- 
cal variation; the participation of a clinical expert is there- 
fore of paramount importance in the development and 
success of the project. 

Linkage analysis 

The next phase isthe performance of linkage analysisto 
localize the yet unknown "disease gene" to a small 
genomic region. This linkage analysis is based on the 
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Figure 1. Schematic representation of a genetic (map-based) approach to identify mutant alleles involved in monogenic and complex phenotypes. 
The human genome is shown as a double straight line in the middle. The top panel shows a simplified strategy for monogenic disorders. 
The responsible gene is mapped by linkage analysis to a small genomic interval (shown in red); subsequently positional identification of 
the pathogenic mutation is based on the nucleotide sequence information. The bottom panel shows a simplified strategy for complex, 
polygenic disorders and traits. The predisposing alleles are mapped by linkage and association studies to several large regions of the 
genome (shown in blue). It is expected that sequence-based association studies will uncover some of the predisposing alleles. SNP, sin- 
gle nucleotide polymorphism. 



Identification of D NA polymorphic markers that coseg- 
regate with the disease phenotype. The D N A markers, 
which constitute part of the normal nucleotide variabil- 
ity of the genome, usually fall into two categories as 
mentioned: the SSRs and the SNPs. For the linkage 
analysis studies, the most useful markers are SSR s since 
they are highly polymorphic. There are more than two 
different (usually six) alleles per SSR marker in the pop- 
ulation, and they are therefore informative in the major- 
ity of the families. M ost of the successful linkage map- 
ping studies have used approximately 300 such markers 
equally distributed throughout the genome with an aver- 
age interval of 10 cM , or 10% recombination between 
adjacent markers. Note that this distance is measured 
in genetic terms, ie, in recombination units in human 
meiosis; 1 cM on average corresponds to approximately 
1000 kb or lO'* nucleotides of D N A . A fter the use of suf- 



ficient markers, the success of a linkage mapping project 
in a monogenic phenotype depends on: 

• T he size of the fami I ies and the D N A s avai lable for study. 
It is imperative to perform a linkage simulation analy- 
sis of the available sample to determine if there is suf- 
ficient statistical "power" to detect linkage. 

• The accuracy of the diagnosis Problems arise when 
affected individuals are categorized as normal or vice 
versa (due to inability to detect the manifestations of 
the phenotype, or reduced "penetrance," ie, the 
absence of phenotype in spite of the presence of the 
mutant gene or late onset of the phenotypic charac- 
teristics). 

• T he extent of the genetic heterogeneity of the phenotype. 
1 1 is much easier to map the disease locus if the pheno- 
type is always due to mutations in the same gene. I n 
contrast, it is much more difficult to map loci for disor- 
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ders/phenotypes that result from mutant alleles of 
more than one gene. A n example of genetic homo- 
geneity is H untington disease, in which all affected 
pedigrees are due to mutations in the same gene on 
chromosomal region 4p/° I n contrast, tuberous sclero- 
sis shows genetic heterogeneity. There are two genes, 
TSCl and TSC2 on chromosomes 9 and 16, respec- 
tively; mutations in each result in the same phenotype 
of tuberous sclerosis" 
The next step after the localization of a disease- related 
locus to a particular genomic interval is to narrow down 
this region to an area of approximately 1 to 2 megabases 
(M b) (1000 000-2000 000 nucleotides). Data provided 
by the preliminary annotation of the human genome 
sequence indicate that, on average, there are 9 genes per 
megabase, with a range between different chromosomes 
from 5 to 23." 

The narrowing down of the "critical region" is there- 
fore important and can generally be achieved by two 
methods. The first is to identify critical recombination 
events between certain D NA markers and the disease 
phenotype in the families examined. This is achieved by 
the addition of affected families and by studying a large 
number of markers in the critical region. 1 1 is advisable 
to rely mostly on recombinants in the D NA s of affected 
individuals. 

The second approach takes advantage of linkage dis- 
equilibrium (LD ), that is the historical recombinants 
between the disease mutation and the polymorphic vari- 
ants surrounding the mutation. The extent of L D or 
allelic association usually defines the area of the disease 
locus. "0 Id" mutations show a short region of LD ; more 
recent, "young," mutations are obviously associated with 
a large region of LD because there were only few 
meioses and generations to restrict the area of L D . L D is 
useful in autosomal recessive disorders with consan- 
guinity, or founder effect autosomal dominant and X- 
linked disorders with ancient mutations I n contrast, LD 
is not contributory in dominant or X -linked disorders 
with many different and recent (only a few generations) 
mutant alleles. 

Positional identification of tlie patliogenic allele 

The next phase requires a search for mutant alleles of 
genes that map within the critical interval. The method- 
ology of this search for the elusive gene has changed 
most dramatically in the last 12 to 15 years. The 



advances of the human genome project provide a pub- 
licly available genomic infrastructure that becomes 
more detailed every year. I n the mid-1980s, it was nec- 
essary to complete the physical map of the critical 
region, ie, to develop an overlapping set of cloned 
human DNAsthat covered the entire critical region. 
Then, it was necessary to identify portions of all genes 
in the critical interval, clone the entire cDNAs, and 
determine the intron-exon junctions and their genomic 
structure.A II of the above steps have now been largely 
accomplished by the international collaboration and 
competition that is collectively called the human 
genome project. This extraordinary project provided a 
dense linkage map,^ a complete physical map of the 
genome,^^^^ a large number of partial gene sequences," 
and, this year, the almost entire human genome 
sequence."^ A t the time of writing (M ay 2001), there 
exists in the public database a sequence of the human 
genome that consists of about 40% finished high-qual- 
ity sequence and -50% draft sequence of lower quality 
with numerous gaps and unordered DNA fragments. 
Less than 10% of the human genomic sequence is still 
unknown. There are two chromosomes, namely 22 and 
21, for which the sequence is complete with only mini- 
mal gaps."There are now catalogues of well-character- 
ized and predicted genes in the entire genomic land- 
scape. These catalogues are freely available and 
constantly updated, for example, see the websites of the 
N ational Center for B iotechnology I nformation and the 
E nsembi project. ^^^^ It is likely therefore that a sub- 
stantial number of genes have already been identified in 
the critical genomic interval that harbors the elusive 
gene for the monogenic disorder of interest. The list of 
genes is by no means complete and the characteriza- 
tion of each gene is far from adequate; the genomic 
efforts in the next couple of years will be directed 
towards the structural and functional characterization 
of all human genes. A t present, however, the unfinished 
gene list per genomic interval provides a wealth of can- 
didates to search for mutations in the patients' DNA 
or R NA . The methodologies for mutation characteri- 
zation have also changed considerably in the last few 
years. Direct sequencing after polymerase chain reac- 
tion (PCR) amplification of exons, intron-exon junc- 
tions, 5'-untranslated region (UTR),and 3'-UTR usu- 
ally detects the majority of mutations that may cause a 
disease phenotype. Certainly, there are exceptions, and 
sequencing after PCR amplification may miss muta- 
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tions. A large, heterozygous deletion, for example, that 
eliminates one or more exons, may go undetected with- 
out a quantitative PC R , or by hybridization after South- 
ern blotting. In some cases, it is necessary to separate 
the two different alleles in vitro and search for muta- 
tions in isolated alleles. 

What is usually the result of a mutation search in an 
"average" positional cloning project? A considerable 
number of sequence variants are identified in the genes 
sequenced. Since the frequency of polymorphic variants is 
about 1 in 1300 nucleotides between two randomly cho- 
sen chromosomes approximately 40 nucleotide differ- 
ences are expected to be found in the genes of the 1-M b 
critical region between normal and affected individuals 
This is based on the fact that about 2% of the genome is 
a coding region of genes, and with the inclusion of 
intron-exon junctions and 5'- and 3'-UTR, approximately 
4% of the 1-M b region is usually sequenced in patients 
The next important question is: which one of these 
nucleotide differences is associated with (or is responsible 
for) the disease phenotype? Several criteria could be 
employed in order to focus on some of the differences. 

• The presence of the variant nucleotide in normal indi- 
viduals from the same ethnic group as the affected indi- 
viduals is normally sufficient to consider these changes as 
nonpathogenic variation. U sually D NA sfrom about 100 
individuals are examined for rare and common variants 

• The predicted consequence of the mutation is impor- 
tant. Nucleotide differences resulting in nonsense 
codons or translational frameshifts, or severe splicing 
defects, are more likely to be pathogenic. 

• D e novo mutations, not present in the parents, partic- 
ularly in sporadic cases of dominant or X -linked dis- 
orders are more likely to be pathogenic. 

• The most difficult to interpret nucleotide changes are 
those that result in amino-acid substitutions (missense 
mutations). Substitutions in evolutionarily well-con- 
served amino acids among homologous proteins in dif- 
ferent species are excellent candidates for pathogenic 
mutations. 

• M utations that are predicted to alter the function of 
the protein or have been experimentally demonstrated 
to do so are excellent candidates 

• Certain mutations need to be tested in model organ- 
isms in order to study their effect. 

• Other mutations require long-term epidemiological 
studies to prove their involvement with a disease phe- 
notype. 



The study of the molecular basis of the disease pheno- 
type in unrelated pedigrees and the demonstration of 
mutations in the same gene often confirm the involve- 
ment of this gene in the disease. 
The description of studies to elucidate the function of 
the disease- related protein and the pathogenetic mech- 
anism of the disease is beyond the scope of this article. 1 1 
is, however, important to emphasize that the evolution- 
ary conservation of genes makes model organisms 
(yeast, worm, fruitfly zebrafish, or mouse) indispensable 
tools for the functional analysis of human genes. 
The methodology described above for gene cloning 
responsible for monogenic disorders has been repeat- 
edly successful.^ A considerable number of disease- 
related genes and alleles have been identified in the last 
15 years. The 0 M I M contains 1168 genes with mutant 
alleles associated with disease phenotypes. M ost of these 
have been identified using positional cloning efforts 
without any previous knowledge of the biochemistry or 
pathophysiology of the disease phenotype. 

Functional gene variants 
for predisposition to common, 
complex, phenotypes 

One of the greatest challenges of this decade for bio- 
medicine is to identify the mutant/polymorphic alleles 
that cause or predispose to common human disorders 
with a strong genetic component. It is not far from the 
truth if we state that the entire effort for the mapping, 
sequencing, and determination of the normal variability 
of our genome has been done in order to be able to 
find the mutant alleles of the common, complex phe- 
notypes. These phenotypes include disorders such as 
schizophrenia and bipolar disease, diabetes asthma, ath- 
erosclerosis, multiple sclerosis, obesity, hypertension, 
A Izheimer's disease, aging, and susceptibility to infec- 
tious diseases. The tasks appear enormous, but the 
expected benefits for medicine could be so profound 
that are certainly worth the effort and expenses from 
both academia and industry. 

The discovery of predisposing mutant alleles for com- 
mon disorders is nevertheless very difficult. A Ithough 
we do not understand all the reasons for this difficulty, 
we could certainly mention the following points 
First, the inheritance of the common complex pheno- 
types is not clearly mendelian. 1 1 is true that there is an 
aggregation of affected individuals in certain families. 
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but the mode of inheritance is not compatible with the 
usual recognizable patterns. 0 n the other hand, there is 
strong evidence for a genetic predisposition for these 
complex disorders": (i) the frequency of these disorders 
in relatives of affected individuals is much greater than 
in the general population; and (ii) the frequency of con- 
cordance in monozygotic twins is always greater than in 
dizygotic twins. For some disorders such as schizophre- 
nia, adoption studies are also in favor of genetic predis- 
position. I n all of these studies, the underlying hypothe- 
sis in favor of genetic predisposition is that people who 
share a greater proportion of alleles have a higher prob- 
ability of manifesting the disease in their lifetimes. 
M onozygotic twins share 100% of their alleles; siblings 
and dizygotic twins share 50% of their alleles; first 
cousins share an eighth of their alleles. Therefore, for a 
complex disease with genetic predisposition, the proba- 
bility of developing the disease is greater in a monozy- 
gotic twin of an affected individual, less in a sibling of an 
affected individual, and even less in a first cousin. The 
fact that the inheritance pattern of these disorders is not 
mendelian renders the use of parametric linkage analy- 
sis difficult or impossible, since these studies require a 
fixed mode of inheritance. 

Second, the phenotype may be uncertain. This is clearly 
evident in the psychiatric disorders in which the diag- 
noses are based only on clinical criteria. The danger of 
misdiagnosis, or mi sclassifi cation, is therefore consid- 
erable. In addition, the age of onset of a phenotype is 
variable even within the same family; this makes it dif- 
ficult to categorize unaffected individuals as truly life- 
time unaffected. A further problem isthat there islikely 
genetic heterogeneity in affected individuals within the 
same family.This is because most of these disorders are 
common and it is therefore possible to have affected 
individuals due to the contribution of mutant alleles of 
different genes. For example, it is not unusual to find 
individuals with breast cancer not related to the B RCA 1 
gene belonging to families with well-documented 
BRCA 1-related breast cancer. After all, breast cancer is 
common since it affects approximately 10% of females 
in their lifetime. 

Third, it is possible that each of the predisposing mutant 
alleles has a minor effect on the phenotype and that 
several mutant alleles from different genes in concert 
result in a pathological phenotype. I n addition, most of 
the predisposing mutant alleles may be common poly- 
morphic variants in the population. U nlike the successes 



of the monogenic disorders, we know of very few 
mutant alleles that predispose to common, complex 
polygenic disorders Such examples include the A PO E 4 
allele, which predisposes to A Izheimer's disease,^" and 
factor V Leiden, which predisposes to deep venous 
thrombosis." 

The most important challenge to the genetic medicine in 
the next decade is certainly to uncover the mutant al- 
leles that predispose to the complex common disorders. 
The advances of the sequences of the human and other 
genomes, and the discovery of the polymorphic vari- 
ability among human genomes, provide the necessary 
infrastructure to tackle these problems. 1 n addition, the 
development of rapid, high-throughput, inexpensive, and 
reliable methods for mutation detection will also con- 
tribute to these discoveries. Finally, the availability of 
samples from a large number of parents, their family 
members, and controls is also another necessary com- 
ponent in this endeavor. 

The methods that exist today for the mapping of pre- 
disposing alleles (PDA s) could be summarized as fol- 
lows (Figure l).^"'^^ 

L inkage analysis methods 

The parametric methods with fixed mode of inheritance 
could still be used in the large rare families segregating a 
complex phenotype. Furthermore, linkage projects usually 
involve small families with complex disorders, in which 
case all possible modes of inheritance should be tested. 
The nonparametric methods also known as model-free 
methods are certainly more suitable for complex pheno- 
typesThese methods score the "amount" of shared alleles 
among affected individualsThe most widely used method 
isthat of sibling pairs. In this potentially interesting alle- 
les are those that are shared in siblings in frequencies sta- 
tistically different from the expected 50% .A large num- 
ber of affected siblings are necessary (most studies used 
100-200 such pairs) and their power to reveal linkage 
increases when the D NA s of their parents are available. 
There are several variations of this method, since all 
affected relatives could be used and nonaffected individ- 
uals could also provide valuable information. M ost of the 
studies with sibpairs use SSR markers because ideally all 
four parental alleles could be recognized. For these stud- 
ies a genome-wide scan usually requires approximately 
300 polymorphic markers placed in the average intervals 
of average size 10 cM . 
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Transmission disequilibrium test 

This test, which is in between linl<age and association, 
estimates the difference between the alleles transmit- 
ted and nontransmitted to patients from their parents 
The null hypothesis for a noncontributing locus is that 
there is no difference between transmitted and non- 
transmitted alleles. I n this method, single affected indi- 
viduals and their parents could be used (usually referred 
as trios). M ost studies have used about 100 such trios. 
The advantage of this method is that it utilizes a power- 
ful internal control of the nontransmitted alleles from 
the same population as the affected alleles. 

Association studies 

The most simple study used to determine the implication 
of a mutant allele to a phenotype is that of association of 
the polymorphic allele to the phenotype.The polymorphic 
allele (usually an SN P) could be PDA , or be within a very 
short genomic distance from the PDA .The question then 
is: how close could an SN P be to the PDA in order to still 
detect an association, or LD ? The answer to this ques- 
tion is complex and depends on the number of the differ- 
ent PDA s to the phenotype, the age of the mutant alleles 
the frequency of the mutant alleles in the population, the 
heterogeneity of the phenotype, and the history and size 
of the population. For example, the average blocks of L D 
in E uropean populations is approximately 60 kb around 
common polymorphisms, whereas in the older African 
human populations these blocks are much smaller.^^ It 
follows that for an association study in the old popula- 
tions one needs many more polymorphic markers than in 
younger populations. The isolated populations may also 
provide an advantage since the blocks of L D may be even 
larger; however, the disadvantage of these populations 
may be that their PDA s are in different genes than those 
of the outbred populations 

M ost of the markers used in LD /association studies are 
SNPs.Theoretically, the most useful SNPsare those that 
change an amino acid, or occur in regions of gene 
expression regulation. M ore than 2 million SN Ps have 
now been identified as part of the genome-sequencing 
effort and a small fraction (0.2%) result in missense 
codons."' 1 1 has been estimated that the average human 
gene shows two to four common variants in the popula- 
tion. It is perhaps more advisable at this stage to con- 
centrate on the SN Ps within genes and their regulatory 



regions as markers for LD /association studies I n addi- 
tion, many investigators recommend the use of two adja- 
cent SNPs in concert as haplotypes (pattern of poly- 
morphic alleles in one chromosome), in order to 
increase the detective power of each site.The samples of 
patients and controls used in association studies have 
to be as identical as possible in terms of genome vari- 
ability; ideally, the two samples need to be drawn from 
the same ethnic group. The sample size is also a matter 
of debate, but larger sample sizes provide more statisti- 
cal power. It may well be necessary to collect samples of 
several thousands of individuals per category The sam- 
ple size largely depends on the contribution of each 
mutant allele to the phenotype, the number of genes 
involved, the age of the mutant alleles and the frequency 
of the mutant PDA sin the unaffected population. 

J oint linl(age and association 

I n this approach, several genomic regions containing 
PDA s are first identified by linkage analyses and then 
LD /association studies are performed in the 10- to 20- 
M b critical regions. It is a common finding that the crit- 
ical intervals from linkage studies of complex disorders 
are 10 times larger than those of monogenic disorders, 
even if considerable numbers of samples are used. In 
addition, linkage and association studies of complex phe- 
notypes reveal several (more than one) areas of sugges- 
tive linkage, some of which could be replicated in sub- 
sequent studies, but others could not be verified. (For a 
recent discussion of the status of such studies in schizo- 
phrenia and bipolar disease, see references 23 and 24, 
respectively.) 

Determination of the predisposing 
mutant alleles 

H ow can we identify the predisposing mutant allele after 
its mapping in the human genome? The answer to this 
question may well be more difficult than the localization 
of thePDA .Let us suppose that thePDA maps to a 100- 
kb region of the genome. Within this interval, there may 
well exist 100 common polymorphic alleles in the popu- 
lation and a substantial number of these would be pre- 
sent in the affected individuals The determination of 
those alleles with a significant contribution to the phe- 
notype may require genotyping of additional affected 
and unaffected individuals from different geoethnic 
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groups, the functional analysis of the variants, and large 
epidemiologic studies It is also possible that different 
alleles in the same gene predispose to the phenotype, 
similar to the situation in which different mutant alleles 
within one gene cause the same monogenic phenotype. 
M odel organisms could also be used to map and clone 
PDA s for common phenotypes. Dueto space limitations, 
the experimental strategies using animal models are not 
discussed here. 

Concluding remarks 

The identification of mutant genes responsible for 
monogenic disorders has been a triumph of medical 
genetics in the last 15 yearsThese discoveries depended 
on the successes of the mapping and sequencing of the 
human genome, and identification of the normal vari- 



La investigacion de variantes de alelos que 
producen trastornos monogenicos o 
predisponen a fenotipos poligenicos, 
comunes y complejos 

La investigacion de genes mutantes para lostras- 
tornos monogenicos ha significado un suceso 
espectacular. Esto se ha podido realizargraciasal 
mapeo e identificacion de la secuencia del geno- 
ma humano, la determinacion de la variabilidad 
de la secuencia, la serie de familias bien caracte- 
rizadascon trastornos mendelianos> el desarrollo 
de metodosestadfsticospara el analisisde enlaces 
y losmetodosde laboratorio para la investigacion 
de mutaciones El desafio actual de la medicina 
genetica esdescifrar las variantes de lassecuen- 
ciasde nucleotidosque predisponen a fenotipos 
poligenicos, comunes y complejos. La metodolo- 
gia para este desafio esta en desarrollo y evolu- 
cion constantes. Se puede anticipar que en los 
proximoslOa 20ahosseran identificados los ale- 
los susceptibles para estos trastornos f recuentes 



ability. These achievements created an environment of 
enthusiasm for further developments high expectations, 
and underestimation of the difficulties that lie ahead in 
the complex, common phenotypes. 
There is a cautious optimism now, in both academia and 
industry for further advances in the identification of these 
functional sequence variants that predispose to the com- 
mon human diseasesThese will certainly continue to rev- 
olutionize medicine and will place genetic medicine at the 
center of the diagnosis and treatment of human disorders 
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La recherche de variants alleliques 
responsables de maladies monogeniques 
ou predisposant aux phenotypes 
polygeniques courants complexes 

La recherche sur les genes mutants des maladies 
monogeniques a ete un succes spectaculaire. 
Celle-ci a ete rendue possible grace a I'etablisse- 
ment de la carte et du sequengage du genome 
humain, a la determination de la variabilite des 
sequences, a la collecte de families bien caracte- 
ristiques porteusesde maladies mendelienneset 
au developpement de methodesstatistiques pour 
I'analyse de liaison ainsi que de methodes de 
laboratoire pour la recherche de mutations. Le 
defi de la medecine genetique est maintenant de 
dechiff rer les variants de sequences nucleotides 
qui predisposent aux phenotypes polygeniques 
courants complexes La methodologie poury par- 
venir est en cours de developpement et en 
constante evolution. II est a prevoirque les genes 
de susceptibilite pour ces maladies courantes 
seront identifies dans les 10 a 20 ansa venir. 
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